Synthesized lengthening of function words - The fuzzy boundary between fluency and disfluency
نویسندگان
چکیده
As [1]’s model of speech production suggests, speakers sense upcoming difficulties and can correct them before uttering. A reasonable strategy to bridge resulting gaps is to prolong the words in the articulatory buffer [2]. This often buys enough time to correct the issue, resulting in standalone disfluent lengthening, after which fluency is resumed [3]. In case of more severe difficulties, the lengthening may be followed by other disfluencies such as silent or filled pauses or repetitions. Similar hesitation strategies might be useful in automatic speech production, e.g. for spoken dialogue systems that interact with human users and typically face a variety of challenges in natural language understanding and generation. Lengthening is an ambivalent phenomenon in speech that seems to be located at the fuzzy boundary between fluency and disfluency. It regularly occurs before phrase boundaries [4][5] and besides constitutes a common hesitation disfluency. Some disfluencies consist of lengthening [3] only, and some lengthenings appear so subtle that they pass unnoticed [6][7]. We assume that these characteristics of lengthening make it a key component in spoken dialogue systems that are capable of producing disfluencies, as they enable to buy a variable amount of time whilst being unobtrusive to the listener [6]. It is not yet known, however, how much synthetic lengthening is acceptable and how lengthening influences the user’s interaction with the system. To address these issues, this study tests the effects of step-wise increases of synthesized lengthening on user ratings and interaction speed.
منابع مشابه
Stuttering on function and content words across age groups of German speakers who stutter.
Recent research into stuttering in English has shown that function word disfluency decreases with age whereas content words disfluency increases. Also function words that precede a content word are significantly more likely to be stuttered than those that follow content words (Au-Yeung, Howell and Pilgrim, 1998; Howell, Au-Yeung and Sackin, 1999). These studies have used the concept of the phon...
متن کاملWhen disfluency is--and is not--a desirable difficulty: the influence of typeface clarity on metacognitive judgments and memory.
There are many instances in which perceptual disfluency leads to improved memory performance, a phenomenon often referred to as the perceptual-interference effect (e.g., Diemand-Yauman, Oppenheimer, & Vaughn (Cognition 118:111-115, 2010); Nairne (Journal of Experimental Psychology: Learning, Memory, and Cognition 14:248-255, 1988)). In some situations, however, perceptual disfluency does not af...
متن کاملNon-linguistic Influences on Rates of Disfluency in Spontaneous Speech
We investigate how non-linguistic factors influence rates of disfluency in spontaneous speech in a set of task-oriented dialogues (the HCRC Map Task Corpus). The factors we consider are: sex of the speaker; sex of the addressee; conversational role; ability to see the addressee; familiarity with the addressee; and practice at the task. Our analyses examined disfluency rate (the number of disflu...
متن کاملDeriving a strategy for synthesizing lengthening disfluencies based on spontaneous conversational speech data
Our overarching research project explores the usability of disfluencies in incremental spoken dialogue systems. This endeavor requires basic phonetic research on disfluencies in spontaneous speech corpora as to define strategies for synthesizing disfluencies in a meaningful way. In this paper, our current research focus lies in an investigation of disfluency-related lengthening as a promising t...
متن کاملPhrase-final rise-fall intonation and disfluency in Japanese - a preliminary study
In Japanese conversations, rise-fall intonation with vowel lengthening often occurs on the final syllable of a phrase. This phrase-final rise-fall (PFRF) is a new type of intonation first reported in the 1960’s. Researchers consider PFRF intonation a discourse marker which functions to sharpen the phrase boundary and retain the utterance turn, but other phrase-final intonation such as phrase-fi...
متن کامل